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Hyaluronidases (HAases), particularly leech HAases, have attracted intense attention due to their broad 
applications in medical treatments and great potential for the enzymatic production of hyaluronan 
oligosaccharides. However, little is known about this third interesting family of HAases. Here, we applied 
the random amplification of cDNA ends polymerase chain reaction (RACE-PCR) approach to identify the 
first leech HAase-encoding gene. By combining protein engineering and high-density culture, we achieved 
high-level production (8.42 X 10 5 U ml" 1 ) in the yeast Pichia pastoris secretory expression system. 
Compared with the commercial bovine testicular HAase, the recombinant leech HAase exhibited superior 
enzymatic properties. Furthermore, analysis of the hydrolytic process suggested that this novel enzyme 
adopts a nonprocessive endolytic mode, yielding a narrow-spectrum of specific HA oligosaccharides with 
different incubation times. Large-scale production of this novel leech HAase will not only greatly promote 
medical applications but also facilitate the enzymatic production of specific HA oligosaccharides. 



Hyaluronic acid (HA) is a linear and unbranched high-molecular weight polysaccharide composed of 
repeating disaccharide D-glucuronic acid (GlcUA) and N-acetyl-D -glucosamine (GlcNAc) units linked 
through (3-1,4 bonds (Supplementary Fig. SI). High-molecular- weight HA is widely distributed among 
various host tissues and participates in numerous physiological processes 1 , and the biological functions and 
applications of HA depend on its molecular mass 2 ' 3 . In particular, low- molecular- weight HA oligosaccharides 
have unique biological activities 3 . Smaller HA oligosaccharides can stimulate fibroblast proliferation and collagen 
synthesis 4 and can selectively kill many types of cancer cells via disruption of the receptor-hyaluronan inter- 
action 5 " 7 . HA octasaccharides (HA8) and decasaccharides (HA 10) have significant suppressive effects on cancer 
cells 8 , and HA tetrasaccharides (HA4) and hexasaccharides (HA6) can induce dendritic cell maturation via Toll- 
like receptor (TLR)-4 associated with the antigen -presenting cells of the innate immune system 9 . In addition, 
lower- molecular- weight HA oligosaccharides are easily absorbed by the body and serve as precursors for the 
synthesis of higher-molecular-weight HA molecules and other substances. Thus, a specific narrow spectrum of 
HA oligosaccharides could have broad applications in medicine, food and cosmetics. Low- molecular- weight HA 
is mainly produced by the degradation of high-molecular- weight HA by physical and chemical methods 10 . 
However, the products of these methods have a broad range of molecular weights (> 3,000 Da), making it difficult 
to obtain HA oligosaccharides with specific molecular weights. Many chemical approaches have been developed 
for the de novo synthesis of HA oligosaccharides 7 . However, these complex processes are time-consuming, 
carbohydrate oligosaccharide backbones are rare, and the substrate uridine diphosphate (UDP) -sugars are 
expensive, limiting the applications of these synthetic methods in large-scale production 11 . In contrast, the 
enzymatic production of HA oligosaccharides with a well-characterised hyaluronidase (HAase) is promising 
and attractive because of its unique advantages, such as mild operation conditions, high degradation rates and 
high product uniformity 12 . 

HAases (also previously referred to as "spreading factors") are a large class of glycosidases that predominantly 
degrade HA. Based on substrate specificities and hydrolysis products, HAases are commonly grouped into three 
families (Supplementary Fig. SI) 13 : hyaluronate lyases (EC 4.2.2.1, Streptococcus hyaluronate lyase), hyaluronate 
4-glycanohydrolases (EC 3.2.1.35, Bovine testicular hyaluronidase, BTH) and hyaluronate 3-glycanohydrolases 
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Figure 1 | Sequence comparison, structural homology modelling and phylogram of LHyal. (a) Multiple sequence alignment of LHyal. The figure was 
generated using the CLUSTALW 1.8 software package based on the known heparanases from Homo sapiens (BAG52128), Ceratotherium simum simum 
(XP_004430958), Rattus norvegicus (NP_072127) and Mus musculus (NP_690016). The conserved residues are marked with '*' and boxed in yellow, with 
the exception of the two predicted key catalytic residues, which are coloured red. Semi-conserved sites are boxed in pale blue. The secondary structure 
predicted by the I-TASSER server is indicated by green arrows to represent P-strands and red cylinders to represent a-helices. (b) Cartoon diagrams of the 
LHyal 3D structural model. The overall model was constructed using the I-TASSER server based on homology modelling utilising the recently solved 
crystal structure of (3-glucuronidase (PDB code: 3VNY) from Acidobacterium capsulatum. The main catalytic domain is an overall regular ((3/a) 8 
TIM barrel fold (Apl-4 surrounded by aHl -8, shown in a), which is typical of glycoside hydrolases. The secondary structure elements in the model are 
colour-coded (helices in red, p-sheets in yellow, others in green). The deduced catalytic residues, Glul72 and D 170, are shown in stick form in purple and 
blue, respectively (labelled), (c) Cartoon diagrams of the bee venom HAase crystal structure (PDB code: 1FCV). The main catalytic domain is shown as an 
overall distorted (P/a) 8 TIM barrel fold. The active-site residues Glull3 and Asplll are coloured in purple and blue, respectively, (d) Phylogenetic 
analysis of the LHyal sequence. The phylogenetic tree of HAase family proteins was constructed using the neighbour-joining (NJ) method. The data are 
presented as an NJ tree based on the amino acid sequences of LHyal and representative glycoside hydrolases: eight related mammalian HAases, four 
microbial HAases and three representative glycoside hydrolases from family 79. The lengths of the nodes represent the substitution rate, which is defined 
as the percentage of substitution sites per alignment length. Bootstrap values >50% are shown on the branch points. One thousand bootstrap replications 
were performed using the software MEGA5.0 53 . 



(EC 3.2.1.36, Leech HAase). Commercial BTH has been widely used 
in clinical medicine, and its hydrolysis mechanism has been studied 
extensively 14 . The disadvantages of the enzymatic production of spe- 
cific or narrow- spectrum HA oligosaccharides by BTH include the 
limited source material (bovine testes), its considerably high price 
and the broad range of degradation products 15 . Compared with BTH 
and Streptococcus hyaluronate lyase, leech HAase has higher sub- 
strate specificity and a narrow- spectrum of enzymatic products 16,17 . 
In addition, the use of recombinant leech HAase does not pose any 
risk of animal cross -infection. Therefore, high-level production of 
recombinant leech HAase would be of great significance for both 
clinical medical treatment (such as surgery, ophthalmology and 
internal medicine) and producing narrow- spectrum HA oligosac- 
charides at the industrial scale. However, no gene sequence has been 
identified and reported for the EC 3.2.1.36 group, which includes 
leech HAase. 

In this work, we cloned and characterised the first leech HAase- 
encoding gene, LHyal. Through a combination of N-terminal engin- 
eering and fed-batch fermentation strategies, we achieved high-level 
production of leech HAases in recombinant Pichia pastoris. More 
importantly, by exploring the hydrolytic process and mechanism, we 
achieved enzymatic production of narrowband HA oligosaccharides. 



The construction of robust microbial factories for the large-scale 
production of recombinant HAase and the development of the 
enzymatic approach for the preparation of specific HA oligosacchar- 
ides would greatly promote progress in related research areas, such as 
the chemical synthesis of oligosaccharides and cancer therapy. 

Results 

In silico characterisation of a potential HAase from leech. After 
analysing the expressed sequence tag (EST) databases of Hirudo sp., a 
potential HAase gene (designated LHyal) (deposited under accession 
number KJ026763) encoding 489 amino acids (Fig. la) was amplified 
from leech cDNAs after three rounds of random amplification of 
cDNA ends polymerase chain reaction (RACE-PCR). Basic Local 
Alignment Search Tool (BLAST) analysis of the deduced amino 
acid sequence suggested that LHyal contains a putative conserved 
glycosyl hydrolase family 79 N-terminal domain. Further multiple 
sequence alignment (MSA) analysis using CLUSTALW demon- 
strated that LHyal has low sequence identity with other well- 
characterised HAases (Supplementary Fig. S2) and exhibits higher 
sequence identity (approximately 35%) with heparanases (EC 
3.2.1.166) (Fig. la). The motif 168 NIDWELGNEP 177 is highly 
conserved, with the exception of the residue D, which is replaced 
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Figure 2 | Analysis of the functional activity and protein expression level of recombinant LHyal. (a) Determination of HAase activity using a typical 
plate assay. Cylindrical holes were injected with the fermentation of LHyal-expressing (left) or control P. pastoris GS115 (right). The clear circle 
corresponds to the distance diffused by LHyal, while the control (undigested HA) precipitated in the gel upon the addition of cetylpyridinium chloride, 
(b) Effects of various length N-terminal His-tags on LHyal expression. The gene copy numbers of all recombinant strains were screened with G418 (3 mg 
ml -1 ) on YPD plates. Typical time course of recombinant LHyal (C) and H6LHyal (d) expression by P. pastoris in a 3-L fermenter during the methanol 
induction phase. LHyal production (red U ml" 1 ) and cell growth density (green OD 600 value) were measured at regular intervals. SDS-PAGE 
analysis of LHyal (e) and H6LHyal (f) expression in the 3-L fermenter at regular intervals during the methanol induction phase. The protein production of 
LHyal and H6LHyal is consistent with the enzymatic activity shown in Fig 3c and 3d, respectively. Black arrows indicate the target protein. 



by S in all heparanases. Interestingly, the central tetrapeptide 
169 IDWE 172 motif, which contains the putative active site acidic 
residues D 170 and E 172 , are conserved in all HAases (Supplementary 
Fig. S2). A neighbour-joining tree was constructed to uncover the 
phylogenetic relationship among the hyaluronidases and 
heparanases (Fig. Id). LHyal is close to the clades of heparanases 
and P- glucuronidase and distant from the clades of mammalian 
HAases, in agreement with previous speculations that heparin-like 
polymers were the first ancestral glucosaminoglycans in metazoan 
life and that leech HAase possibly evolved to acquire specificity for 
HA, losing its ancestral heparanase activity 18 . Both the MSA and 
phylogenetic tree analysis suggested that LHyal should be a 
potential endo-p-glucuronidase (family 79), consistent with a 
previous classification 13 . 

The mechanism of action of the third group of endo-p-glucuro- 
nidases resembles that of the eukaryotic or vertebrate hyaluronate 4- 
glycanohydrolases more closely than that of the bacterial hyaluronate 
lyases 18 . Accordingly, a 3D model of LHyal was built based on the 
recently solved crystal structures of P- glucuronidase (PDB: 3VNY) 
from Acidobacterium capsulatum 19 using the widely used server I- 
TASSER, which depends on multiple-threading alignments and iter- 
ative template fragment assembly simulations 20 . The overall fold 
model suggests that, similar to the well-characterised bee venom 
hyaluronidase (bvHyal) and human Hyaluronidase- l(hHyal- 
l) 21 " 23 , LHyal consists of two distinct structural domains: a potential 
catalytic domain and a small P- domain (Fig. lb). Similar to bvHyal 
(Fig. lc) and hHyal-1 23 ' 24 , the potential catalytic domain of LHyal 
adopts a classic and regular (p/a) 8 barrel fold and contains the resi- 
dues Glul72 and Asp 170, which are conserved in hyaluronan 
catalysis 23 and are also common to many hydrolases 22 . The small 
P-domain adapts a typical antiparallel structure consisting of six P- 
strands (1 N-terminal p-strand and 5 successive C-terminal p- 
strands, Fig. lb); this structure is also commonly observed in many 
glycoside hydrolases 25 . 



High-level production of functional recombinant LHyal. To 

prepare to investigate the properties of the potential hyaluronidase 
LHyal, P. pastoris GS1 15, a widely used eukaryotic expression system 
with unique advantages (such as high-density growth, endotoxin- 
free culture supernatant, secretory expression for easy purification 
and glycosylation) 26 " 28 , was selected for the heterologous expression 
of LHyal. The ot-factor signal peptide from Saccharomyces cerevisiae 
was fused with the LHyal sequence to enable secretory expression. 
Flask cultivation demonstrated that LHyal was successfully 
expressed and secreted into medium with a final HAase activity of 
11954 U ml" 1 (Fig. 2b). A visual HA plate assay was also performed 
to verify and confirm this HAase activity. HA hydrolysis by the 
culture supernatant produced a clear transparent zone (Fig. 2a), 
strongly demonstrating that leech LHyal is an HAase and can be 
functionally overexpressed in P. pastoris. 

His-tags have been extensively applied for recombinant protein 
expression (generally as a 6xHis-tag) to simplify and facilitate the 
purification process. Four His-tags of different lengths were fused to 
the N-terminus of LHyal to determine whether different His-tags 
would influence the protein expression level. As expected, fusion of 
the different His-tags resulted in the distinct accumulation of extra- 
cellular HAase (Fig. 2b) in the flask cultures. In particular, fusion 
with the 6XHis tag (H6LHyal) significantly increased HAase activ- 
ity, with a titre of 63180 ± 460 U ml" 1 . To further improve the 
expression level, high-cell-density cultivation (HCDC) was con- 
ducted in a 3-L bioreactor to evaluate the production of recombinant 
LHyal and H6LHyal. As expected, LHyal production increased 
remarkably to a high value of 8.50 X 10 4 U ml" 1 (Fig. 2c), a 7.1-fold 
increase compared to the flask cultures. In contrast, production of 
the recombinant H6LHyal fused to the 6XHis tag was dramatically 
improved to 8.42 X 10 5 Uml" 1 (approximate 420 mgL" 1 ) (Fig. 2d), 
an extremely large increase. In parallel, the accumulation of the 
recombinant LHyal and H6LHyal was monitored by sodium dodecyl 
sulphate polyacrylamide gel electrophoresis (SDS-PAGE) analysis at 
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Figure 3 | Biochemical properties of LHyal. (a) Purification of the H6LHyal protein using Ni-NTA agarose. M = Marker proteins, C = fermentation 
supernatant of the control strain P. pastoris GS115, FS = fermentation supernatant of H6LHyal, PE = purified H6LHyal. (b) Effect of temperature on 
LHyal activity. The enzymatic activity of LHyal was measured at various temperatures in 50 mM citrate buffer (pH 5.5) using HA (1.6 mg ml" 1 ) as the 
substrate. The logarithm (Ln) of the specific activity (U mg -1 ) was plotted against the reciprocal of the absolute temperature ( T). The activation energy 
(E fl , kj mol" 1 ) was calculated from the value of the slope (S) and the general gas constant (R, 8.3 J mol" 1 K) according to the Arrhenius equation: E a = — S 
X 2.306 X R. (c) After pre-incubating the purified enzyme (0.45 mg ml" 1 ) at 38°C (red Y), 50°C (purple A), 55°C (blue ♦) or 60°C (green ■) for 
up to 180 min, the remaining enzymatic activity was determined at 38°C in 50 mM Na 2 HP0 4 -citrate buffer (pH 5.5). (d) Effect of pH on LHyal activity 
and stability. The enzyme activity was determined at various pH values at 38°C. The stability was assayed at 38°C in 50 mM Na 2 HP0 4 -citrate buffer (pH 
5.5) after pre-incubating the purified enzyme (0.45 mg ml" 1 ) for 30 min at 25°C in buffers of pH values from 3.0-8.0. The relative activity is represented 
as a percentage of the maximum activity. (O) 50 mM Na 2 HP0 4 -citrate buffer (pH 3.0-6.0); ( O ) 50 mM Na 2 HP0 4 -NaH 2 P0 4 buffer (pH 6.0-8.0). 
Analysis of the degradation of chitin (e), heparin (f) and chondroitin sulphate (g) by HAases using the DNS method. In the control group, the 
corresponding substrates (2 mg ml" 1 ) were incubated without enzyme in 50 mM citrate buffer (pH 5.5) at 38°C for 30 min. In the LHyal and BTH 
groups, substrate was incubated with LHyal (1,000 U) or bovine testicular hyaluronidase (BTH, 1,000 U) under the same treatment conditions. 



regular intervals. Compared with LHyal (Fig. 2e), the protein bands 
corresponding to recombinant H6LHyal increased in intensity with 
prolonged cultivation time (Fig. 2f). Thus, fusion with 6 His residues 
at the N-terminus increased the expression level, resulting in much 
higher HAase activity. 

Enzymatic properties of the recombinant H6LHyal. The extracel- 
lular H6LHyal was first enriched using a Ni-NTA affinity column. 
After washing away the background proteins with increasing 
concentrations of imidazole and subsequent dialysis, the purified 
protein, which had a specific activity of 1.20 X 10 6 U mg" 1 , was 
analysed by SDS-PAGE. A clear band with an apparent molecular 
weight of 58 kDa (Fig. 3a) was observed, consistent with the value of 
58,684.0 Da (Supplementary Fig. S6) observed by matrix- assisted 
laser desorption/ionisation quadrupole time-of-flight mass spec- 
trometry (MALDI Q-TOF MS). The optimum temperature for 
H6LHyal activity was 45 °C, and the activity decreased significantly 
above 55°C (Fig. 3b). The activation energy of H6LHyal was further 
calculated from an Arrhenius plot, which yielded a value of 23.90 kj 
mol" 1 (Fig. 3b). This value is much lower than that of most other 
hydrolases, indicating the high HA degradation efficiency of 
H6LHyal. H6LHyal was stable for 2 h when incubated at 55 °C but 
lost 100% of its activity after incubation for 30 min at 60°C (Fig. 3c). 



Interestingly, a thermal activation phenomenon was observed when 
H6LHyal was incubated at temperatures below 60°C. H6LHyal 
exhibited an optimal pH of 6.5 and remarkable stability over a 
broad pH range (3.0 to 8.0) (Fig. 3d). H6LHyal was significantly 
inhibited by Mn 2+ , Cu 2+ and Fe 3+ (Supplementary Table S3). The 
substrate specificity of H6LHyal was compared with that of BTH. In 
addition to HA, BTH can also hydrolyse chondroitin sulphate 
(Fig. 3g), while H6LHyal cannot hydrolyse chitin, heparin or 
chondroitin sulphate (Fig. 3e-3g), validating a previous hypothesis 
that the substrate specificity of LHyal differs from that of BTH 29 . 
Specifically, no differences in temperature stability, metal ion 
sensitivity, pH stability, or substrate specificity were observed 
between recombinant H6LHyal and LHyal (data not shown). 

Exploration of the hydrolysis process. The hydrolysis mechanisms of 
the bovine testicular HAase and bacterial HAase have been extensively 
studied and reviewed 14 ' 29 . However, although different leech HAases 
have been extracted and described 1630 , the hydrolytic process has 
remained unclear. The high-level production of recombinant 
H6LHyal enabled us to explore the hydrolytic process in detail. 
Under normal conditions (Methods), the degradation products 
obtained at different incubation times (1, 2, 4 and 8 h) were 
analysed by hybrid ion trap time-of-flight liquid chromatography/ 
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Figure 4 | LC-MS-IT-TOF profile of LHyal-catalysed HA hydrolysis. The reaction mixture, which contained 1.6 mg of HA and 640 U of HAase per ml in 
50 mM citrate buffer (pH 5.5), was incubated at 38°C for 1, 2, 4 or 8 h. At each time point, samples were withdrawn, and the reaction was terminated by 
immersion in boiling water, followed by LC-MS-IT-TOF analysis. The total ion chromatograms of the 1-h (a), 2-h (b), 4-h (c) and 8-h (d) reactions are 
shown. The ion chromatograms of HA tetrasaccharide (HA4), hexasaccharide (HA6), octasaccharide (HA8) and decasaccharide (HA10) are shown in 
blue, pink, red and green, respectively. The mass spectra of the 1-h (e), 2-h (f), 4-h (g) and 8-h (h) reactions are shown. The anion m/z of 775.22 [M-H] " 
(HA4), 576.66 [M-H] 2 " (HA6), 766.22 [M-H] 2 " (HA8) and 955.77 [M-H] 2 " (HA10) are marked with blue, pink, red and green, respectively, 
(d) Schematic representation of the putative hydrolytic process catalysed by LHyal. LC-MS-IT-TOF, which couples atmospheric pressure ionisation with 
Ion-Trap (IT) and Time- of- Flight (TOF) technologies and delivers high mass accuracy (ppm) and high mass resolution (10,000 at 1,000 m/z) 
independent of the MS mode, was employed to analyse the hydrolysis of HA oligosaccharides during the degradation process. Analysis conditions and 
parameters: electrospray ion source; negative-ion acquisition mode; interface voltage of — 3.50 kV; nebuliser gas flow of 1.5 1 min" 1 ; CDL temperature of 
200°C; block heater temperature of 200°C; detector voltage of 1.60 kV; drying gas at 100 kPa; ion accumulation time of 10 msec; MS range of m/z 300 to 
1,500; and MS2 or MS3 ranges of m/z 50 to precursor ion mass. The CID parameters were energy of 50%, collision gas at 100% and 10 repeats. MS data 
were processed with LCMS solution ver. 3.4 software (Shimadzu). 



mass spectrometry (LCMS-IT-TOF) in negative ion mode (Supple- 
mentary Table S2). Surprisingly, after 1 h of incubation, no small HA 
oligosaccharides (less than 3,000 Da) were detected (Fig. 4a, 4e), in 
contrast to results observed for other groups of HAases. Further 
investigation revealed that all high-molecular -weight HAs were 
rapidly degraded to low-molecular-weight fragments consisting of 
approximately 30 saccharide units with an average molecular weight 
of 2.60 X 10 4 Da (Supplementary Fig. S5). At 2 h, two prominent ion 
peaks (Fig. 4b) corresponding to the decasaccharide (HA10) and the 
octasaccharide (HA8) were detected and identified (Fig. 4f); no larger 
ion peaks corresponding to other oligosaccharides were detected. 
When the incubation was extended to 4 h, two new ion peaks 
corresponding to the hexasaccharide (HA6) and the tetrasaccharide 
(HA4) appeared, while the peak corresponding to HA10 disappeared 
(Fig. 4c, 4g), indicating that HA 10 was completely digested to HA6 
and HA4. Finally, only two peaks corresponding to HA6 and HA4 
were detected (Fig. 4d, 4h) after 8 h of digestion. The final products, 
HA6 and HA4, were further analysed and confirmed by multistage 
mass spectrometry (Supplementary Fig. S7). Consistent with a 
previous report 17 , the Moran-Elson reaction revealed that the HA 
oligosaccharides contained glucuronic acid at the reducing end 31 
(Supplementary Fig. S3). Interestingly, no disaccharide (HA2) was 
detected throughout the degradation process. For comparison, we 
also investigated the hydrolysis products of BTH. In agreement with 
previous reports, HA2 was generated at the beginning of hydrolysis 
(Supplementary Fig. S4a) and the oligosaccharides HA2, HA4, HA6, 
HA8, HA10, HA12 and HA14 were all simultaneously detected after 
complete degradation (Supplementary Fig. S4b), demonstrating that 



the HA degradation processes catalysed by BTH and H6LHyal are 
quite different. 

Discussion 

HAases have been the subject of intense research due to their con- 
siderable potential for use in biotechnological processes, particularly 
the pharmaceutical industry. The discovery of new HAases with 
novel properties is thus an important research area 14 . Although dif- 
ferent types of the third family of HAases, leech HAases, have been 
isolated and biochemically characterised 32 , no sequence data are 
available, and little is known about this potentially interesting class 
of enzymes 18 . In this work, we cloned a novel leech HAase designated 
LHyal. In silico analysis demonstrated that LHyal should be classified 
in glycosidase family 79 and is highly homologous to heparanases, 
indicating a common ancestor of these enzymes 33 . Although LHyal is 
quite different from the two other groups of HAases, the classic 
(p/a) 8 barrel fold and the two amino acid residues (D 170 and E 172 ) 
that serve as the nucleophile and acid/base catalysts that are present 
in human Hyals-1 and PH-20 are conserved 24 ' 34 . Functional over- 
expression of LHyal confirmed its identity as a novel HAase. In 
addition, the successful high-level expression of HAase is desirable 
for its broad application in clinical medicine 35,36 . Despite extensive 
study 3738 , BTH is the sole commercially available HAase 39 because 
the expression levels (activity) of most recombinant HAases are too 
low to meet practical demands. Here, by employing N-terminal 
engineering and fed-batch fermentation, a large amount of HAase 
was obtained, with a crude titre of 8.42 X 10 5 U ml" 1 , a value much 
higher than that of commercial BTH (1,005 U mg" 1 , 20-25% pure) 
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extracted from bovine testicle 40 . Compared with the acidic BTH 14 , the 
easily purified H6LHyal exhibits superior enzymatic properties (a 
near-neutral optimum pH of 6.5 and an optimum temperature of 
45 °C, with excellent temperature resistance). In addition, H6LHyal 
has superior substrate specificity (Fig. 3g) and has no transglycosi- 
dase activity. Consequently, H6LHyal is more suitable for medical 
applications. 

HAs of small molecular mass, particularly HA oligosaccharides 
composed of 4-18 sugar residues, have attracted much attention for 
their potential in medical applications 5 ' 41 " 43 , such as cancer therapy 
and nerve cell regeneration. Although efficient isolation and puri- 
fication approaches have been developed 15,44 " 46 , the preparation of 
HA oligosaccharides of specific lengths as hydrolysis products of 
bovine testicular or microbial HAases remains low-yield, and a broad 
spectrum of products is typically obtained (more than 20 different 
HA oligosaccharides) 18,29 . In contrast, the novel HAase H6LHyal that 
was identified and highly overexpressed in this work yields a narrow 
range of products (Fig. 4d), with glucuronic acid at the reducing end 
(Supplementary Fig. S3). This product specificity might be due to the 
preferential digestion of longer HA chains by H6LHyal (Fig. 4i) 
through a nonprocessive endolytic mode, similar to human HAase 
2 47 . Rapid degradation of the long HA chains (HA 10) and slow 
degradation of the short chains (HA8) resulted in the accumulation 
of shorter HA oligosaccharides (HA8, HA6 and HA4) (Fig. 4b, 4c). 
The results confirmed that LHyal is a hyaluronate 3-glycanohydro- 
lase (EC 3.2.1.36) without transglycosylation activity, consistent with 
previous reports 13,16,17 . Therefore, by controlling the incubation time, 
specific HA oligosaccharides could be produced in high yield, greatly 
facilitating downstream purification and improving productivity. 
The above results suggest that the high-yield novel H6LHyal has 
great potential for the preparation of specific HA oligosaccharides 
at the commercial scale. 

In conclusion, we successfully cloned the first leech hyaluronate 3- 
glycanohydrolase family (EC 3.2.1.26) gene and achieved its high- 
level heterologous overexpression. Compared with commercial 
BTH, this novel HAase has better substrate specificity and a vastly 
different hydrolysis mechanism. By controlling the incubation time, 
different HA oligosaccharides, particularly HA10, HA8, HA6 and 
HA4, can be selectively generated with high yields. The large-scale 
expression of H6LHyal and the enzymatic production of specific HA 
oligosaccharides are not only advantageous for the cosmetic, health- 
care and medical industries but will also enable polysaccharide 
chemical synthesis and cancer therapy research 48 . 

Methods 

Bacterial strains, plasmids and growth conditions. The E. coli strain DH5a 
(Stratagene, La Jolla, CA, USA) and pMD19 (Takara Biotech, Dalian, China) were 
used as the host and vector, respectively, to clone the HAase gene; the resulting 
plasmid was used for sequencing. E. coli were grown in Luria-Bertani (LB) medium at 
37°C, supplemented with 100 ug ml" 1 ampicillin. Pichia pastoris GS1 15 and pPIC9K 
(Invitrogen, Carlsbad, CA, USA) were used as the host and vector, respectively, for 
heterologous expression of the HAase. YPD medium (g l" 1 : yeast extract 10, peptone 
20 and glucose 20) was used for seed culture. Buffered minimal glycerol yeast medium 
(BMGY, yeast extract 10 g l" 1 , peptone 20 g 1~\ glycerol 10 ml l" 1 , 100 mM 
potassium phosphate, 0.34% yeast nitrogen base without amino acids [YNB], 4 X 
10" 5 % biotin, 1% ammonium sulphate) and buffered methanol minimal yeast 
medium (BMMY, yeast extract 10 g l" 1 , peptone 20 g l" 1 , 100 mM potassium 
phosphate, 0.34% YNB, 4 X 10" 5 % biotin, 1% ammonium sulphate, 1% methanol) 
were used for flask cultivation. For fermentation, basal salts medium (BSM) with the 
following composition was used (g l" 1 ): glycerol 40, K 2 S0 4 18, MgS0 4 *7H 2 0 14.9, 
KOH 4.13, H3PO4 27 and CaS0 4 -2H 2 0 0.93, with 4.4 ml l" 1 of PTM1 trace metal 
solution. PTM1 solution contained the following (g l" 1 ): CuS0 4 *5H 2 0 6, KI 0.09, 
MnS0 4 -H 2 0 3, H 3 B0 3 0.02, MoNa 2 0 4 -2H 2 0 0.2, CoCl 2 0.5, ZnCl 2 20, FeS0 4 -7H 2 0 
65 and biotin 0.2, with H 2 S0 4 5.0 ml l" 1 . For glycerol-fed batch cultivation and 
methanol-fed batch cultivation, 50% (w/v) glycerol and pure methanol containing 
12 ml L -1 PTM1 solution were used, respectively. 

Cloning of the HAase gene LHyal. All primers used in this study are provided in 
Supplementary Table SI. Total RNA was extracted from the heads of wild leeches 
using a tissue total RNA extraction kit (Hangzhou Biosci Co., Ltd, China.). cDNA was 



synthesised in a 20- ul reaction (M-M1V reverse transcriptase 1 ul, RNase Inhibitor 
1 5X first-strand buffer 4 ul, oligo(dT) 18 primers 1 ul, 10 mM each dNTP mix 
1 ul, total RNA 12 ul) using the M-M1V first-strand RT kit (Hangzhou Biosci Co., 
Ltd, China). The 3' end of the HAase gene (LHyal) was amplified with gene-specific 
degenerate primers (GSDF/GSDR), which were designed based on candidate 
hyaluronidase cDNAs (GenBank: FP628211.1 and JZ186329.1) similar to 
heparanases from a Hirudo medicinalis library in the EST database (http://www.ncbi. 
nlm.nih.gov/nucest/). PCRs were performed in 50- ul volumes using 5 ul of cDNA as 
the template, 20 pmol of each primer, 10 ul of PCR buffer, 0.25 mM dNTPs and 1 
unit of DNA Polymerase under the following conditions: 94°C for 3 min; 35 cycles of 
94°C for 30 s, 50-55°C for 30 s and 72°C for 1 min; and 72°C for 10 min. To obtain 
the 5' fragment of LHyal, first- strand cDNA synthesis was performed to generate the 
template using the SMART RACE cDNA Amplification Kit (CLONTECH) according 
to the manufacturer's instructions. 5 '-RACE PCR was performed as previously 
described using Universal Primer Mix (UPM) and a gene- specific antisense primer 
(GSPR) designed to the obtained 3 '-end sequence of LHyal. 

Analysis of the LHyal sequence. To obtain the maximum 5' sequence, the 5 '-ends of 
5 separate 5 '-RACE clones were sequenced. A putative 1470-bp open reading frame 
(ORF) corresponding to a potential protein-coding segment was identified based on 
the sequences of the 3'- and 5 '-ends, and primers (HAaseF and HAaseR) were 
designed to amplify the LHyal sequence. The PCR products were extracted from an 
agarose gel and cloned into the pMD19-T simple carrier for sequencing. Initial 
comparisons of the translated protein sequences to the protein database were 
performed using the BLASTP program (http://www.ncbi.nlm.nih.gov/blast/). To 
confirm its identity, the inferred amino acid sequence of the cDNA was aligned 
(CLUSTALW) against representative HAase sequences (acquired by BLAST searches 
of the genetic databases) from mammals, venomous insects and bacteria and 
mammalian heparanases. The theoretical structure of LHyal was obtained by 
homology modelling using the I-TASSER webserver 20 and the crystal structure of the 
glycoside hydrolase family 79 P- Glucuronidase from Acidobacterium capsulatum as a 
template 19 . The estimated accuracy quality of the modelled structure was 0.71 ± 0.1 1 
(TM- score). To analyse the evolutionary relationship of this leech HAase, a 
phylogenetic tree of LHyal sequences and known HAases and glycoside hydrolases 
from the GenBank database was constructed using MEGA 5.0. 

DNA manipulation. To construct LHyal-pPIC9K, the HAase-encoding gene was 
amplified with the primers LHyalF/LHyalR, which introduced EcoRI/NotI restriction 
sites at the 5' and 3' ends of the LHyal gene, respectively. The purified PCR product 
was digested with EcoRI/NotI and ligated into EcoRI/NotI- digested pPIC9K 
containing the a-factor signalling sequence, creating LHyal-pPIC9K. To produce the 
constructs in which various His tags were fused to the N-terminus of LHyal, the 
primers H4LHyalF/LHyalR, H6LHyalF/LHyalR, H9LHyalF/LHyalR and 
H12LHyalF/LHyalR were used to introduce 4, 6, 9 and 12 His residues at the N- 
terminus of LHyal to create H4LHyal-pPIC9K, H6LHyal-pPIC9K, H9LHyal-pPIC9K 
and H12LHyal-pPIC9K, respectively. The recombinant plasmids were transformed 
into chemically competent E. coli DH5a prepared using standard CaCl 2 methods. 
Unless otherwise stated, the recombinant plasmids were linearised with Sail and 
transformed into P. pastoris GS115 by electroporation as described previously 49 . 

Overexpression and high-cell-density fermentation of LHyal. Positive P. pastoris 
recombinants carrying LHyal were cultivated in YPD medium at 30°C. A 5-ml 
overnight culture was transferred to 50 ml of BMGY medium and incubated in 500- 
ml Erlenmeyer flasks rocking at 200 rpm at 30°C. When the yeast culture OD 600 
reached 4-6, the cells were collected and transferred to 50 ml of BMMY induction 
medium containing methanol (1% v/v) for 96 h at 30°C. 

High-cell-density fermentation was performed in a 3-L fermenter (LiFlus GM 
BioTRON, Korea) containing 800 ml of BSM supplemented with PTM1 trace salts 
solution 50 with the following parameters: temperature of 30°C, pH 5.5, agitation at 
500 rpm and air flow rate of 2.0 wm. Two 50-ml seed cultures were added to the 
fermenter, and when glycerol was exhausted, the fed-batch phase was initiated by 
feeding 50% (w/v) glycerol plus 12 ml l" 1 PTM1 at index-fed-batch rates of 13.5, 16.2, 
19.2, 22.8, 27.2 and 32.4 ml h" 1 L" 1 for the first 6 h. For the subsequent 6 h, the 
culture was fed at a constant fed-batch rate of 30 ml h" 1 1" 1 . The methanol induction 
phase was initiated at 25 °C and pH 5.5 with agitation at 1,000 rpm by feeding 
methanol containing 12 ml l" 1 PTM1 at a concentration of 1.8% (v/v) for 3-5 h after 
the glycerol was exhausted. The methanol feed rate was controlled with a methanol 
sensor (FC-2000, Superlnfo. Tech. Co. Ltd., Shanghai, China). 

Purification of the LHyal protein. The fermentation supernatant and Ni-NTA 
agarose (Qiagen, Hilden, Germany) were loaded onto a gravity- flow column and 
incubated for 4 h at 4°C. They were then washed with a stepwise gradient of imidazole 
(10-60 mM) in phosphate buffer to remove contaminating proteins. The bound N- 
terminal His-tagged protein was eluted from the column with phosphate buffer 
containing 500 mM imidazole, dialysed against a gradient of sodium chloride 
concentrations (300, 100 and 0 mM) in 50 mM phosphate buffer pH 6.0 at 4°C for 
24 h. The dialysed protein was lyophilised in a Christ Alpha 1-5 instrument (Christ, 
Osterode, Germany). The purified enzymes were analysed by SDS-PAGE and stained 
with Coomassie Brilliant Blue G-250. The concentration and total amount of protein 
were determined according to the Coomassie Brilliant Blue R250 assay method. 
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Enzymatic characterisation of H6LHyal. HAase activity was determined using the 
simple plate assay of Richman et al 51 . The assay plate was prepared with 1 mg ml" 1 
HA, 1.5% agarose, 50 mM sodium citrate (pH 5.5) buffer, 0.15 M NaCl and 0.02% 
(w/v) sodium azide. The fermentation was emptied into the cylindrical holes (3 mm 
in diameter) on the agarose plates, incubated at 37°C for h and covered with 10% 
(w/v) cetylpyridinium chloride. The formation of a distinct clear halo around the hole 
indicated HAase activity. Leech HAase activity was quantified by measuring the 
amount of reducing sugar liberated from HA with the 3,5-dinitrosalicylic acid (DNS) 
colorimetric spectrophotometric method 52 . One unit of enzymatic activity was 
tentatively defined as equal to the reducing power of glucuronic acid (glucose 
equivalents in micrograms) liberated per hour from HA at 38°C and pH 5.5. Specific 
activity was defined as units of enzyme per mg of protein. Unless otherwise indicated, 
the standard enzymatic reaction contained 1.6 mg ml" 1 of HA as the substrate and an 
appropriate amount of H6LHyal in 50 mM pH 5.5 citrate buffer in a total volume of 
1 ml and was incubated at 38°C for 10 min. The reaction was stopped by immersion 
in boiling water for 5 min and then examined using the DNS method. Controls 
without enzyme were prepared and analysed in an identical manner. The substrate 
specificities of H6LHyal and BTH were determined using heparin, chondroitin 
sulphate and chitin. N-acetyl-hexosamine moieties at the reducing ends of 
oligosaccharides liberated from HA were determined using the Morgan-Elson 
reaction. 

Preparation and analysis of low-molecular-weight HA oligomers. High- 
molecular- weight HA was prepared at a concentration of 2 mg ml" 1 in 50 mM citrate 
buffer (pH 5.5). The reaction mixture, which contained 1.6 mg of HA and 640 U of 
HAase per ml, was incubated under standard conditions for 1, 2, 4 and 8 h. The 
hydrolysis of HA by commercial BTH and the resulting products were investigated 
under the same conditions. The assays were heated in boiling water for 5 min to 
terminate the reaction, filtered through a 0.22- urn filter and analysed with LCMS-IT- 
TOF. A 10- ul aliquot of the sample solution was introduced into the ESI ion source of 
the mass spectrometer via the autosampler of an HPLC system equipped with a C 18 
column (150 X 2.0 mm; Shimadzu, Kyoto, Japan). The mobile phases were water and 
acetonitrile, and a 0-20 min linear gradient of 0-12% acetonitrile under a constant 
flow rate (0.2 ml min" 1 ) was used. Mass spectrometry was performed in negative ion 
mode with scanning over the m/z range from 100-1,500 at 10 s/scan. Multi-stage 
mass spectrometry was performed under the same operating conditions. 
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